Channel and noise normalization using affine transformed cepstrum

نویسندگان

  • Xiaoyu Zhang
  • Richard J. Mammone
چکیده

This paper addresses the environmental mismatch problem that arises from noise and channel variabilities. A new feature mapping technique based on an optimal a ne transform of the cepstrum is proposed to solve the mismatch problem observed over the speaker recognition systems. It is designed based on the fact that both the channel and noise interferences basically cause the cepstrum space to undergo an a ne transformation. By taking an inverse transformation, we can easily decouple from the speech the e ects of the channel and noise. Alternatively, we can take a forward transform of the training data to simulate the operating conditions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind normalization of speech from different channels

We show how to construct a channel-independent representation of speech that has propagated through a noisy reverberant channel. This is done by blindly rescaling the cepstral time series by a nonlinear function, with the form of this scale function being determined by previously encountered cepstra from that channel. The rescaled form of the time series is an invariant property of it in the fo...

متن کامل

Channel compensation of modulation spectral features

We propose a new channel compensation method for modulation spectral features. We compare our proposed method, subband normalization, with a more traditional method, cepstral mean subtraction (CMS). Experimental results show that subband normalized modulation scale features provide advantages over CMS features. The proposed method is not only robust to slowly varying convolutional noise, but al...

متن کامل

Automatic Segmentation of Speech Recorded inUnknown Noisy Channel

This paper investigates the problem of automatic segmentation of speech recorded in noisy channel corrupted environments. Using an HMM-based speech segmentation algorithm, speech enhancement and parameter compensation techniques previously proposed for robust speech recognition are evaluated and compared for improved segmentation in colored noise. Speech enhancement algorithms considered includ...

متن کامل

Noise and Channel Normalized Cepstral Features for Far-speech Recognition

The paper analyses suitable features for distorted speech recognition. The aim is to explore the application of command ASR system when the speech is recorded with far-distance microphones with a possible strong additive and convolutory noise. The paper analyses feasible contribution of basic spectral subtraction coupled with cepstral mean normalization in minimizing of the influence of present...

متن کامل

Maximum likelihood normalization for robust speech recognition

It is well-known that additive and channel noise cause shift and scaling in MFCC features. Empirical normalization techniques to estimate and compensate for the effects, such as cepstral mean subtraction and variance normalization, have been shown to be useful. However, these empirical estimate may not be optimal. In this paper, we approach the problem from two directions, 1) use a more robust ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996